Policy Improvement for several Environments

نویسندگان

  • Andreas Matt
  • Georg Regensburger
چکیده

In this paper we state a generalized form of the policy improvement algorithm for reinforcement learning. This new algorithm can be used to ...nd stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. We ...rst introduce a geometric interpretation of policy improvement, de...ne a framework to apply one policy to several environments, and propose the notion of balanced policies. Finally we explain the algorithm and present examples.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Policy Improvement for several Environments Extended Version

In this paper we state a generalized form of the policy improvement algorithm for reinforcement learning. This new algorithm can be used to ...nd stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. We ...rst introduce a geometric interpretation of policy improvement, de...ne a framework to apply one policy to several envir...

متن کامل

Approximate Policy Iteration for several Environments and Reinforcement Functions

We state an approximate policy iteration algorithm to find stochastic policies that optimize single-agent behavior for several environments and reinforcement functions simultaneously. After introducing a geometric interpretation of policy improvement for stochastic policies we discuss approximate policy iteration and evaluation. We present examples for two blockworld environments and reinforcem...

متن کامل

Intra Sector Policy Interventions for Improvement of Iranian Health Financing System

Background and purpose: To determine an appropriate financial model for the health system of Iran, several studies have been conducted. But it seems that these studies were not comprehensive and further investigation is required. So to design a valid and enforceable mechanism, the study of policy interventions will be considered through consensus of all stakeholders. This investigation was done...

متن کامل

How Neoliberalism Is Shaping the Supply of Unhealthy Commodities and What This Means for NCD Prevention

Alcohol, tobacco, and unhealthy foods contribute greatly to the global burden of non-communicable disease (NCD). Member states of the World Health Organization (WHO) have recognized the critical need to address these three key risk factors through global action plans and policy recommendations. The 2013-2020 WHO action plan identifies the need to engage economic, agricultural and other relevant...

متن کامل

Multisectoral Actions for Health: Challenges and Opportunities in Complex Policy Environments

Multisectoral actions for health, defined as actions undertaken by non-health sectors to protect the health of the population, are essential in the context of inter-linkages between three dimensions of sustainable development: economic, social, and environmental. These multisectoral actions can address the social and economic factors that influence the health of a population at the local, natio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001